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Abstract. In this paper we consider the problem of inference on a class of sets de- 
scribing a collection of admissible models as solutions to a single smooth inequality. 
Classical and recent examples include, among others, the Hansen- Jagannathan (HJ) 
sets of admissible stochastic discount factors, Markowitz-Fama (MF) sets of mean- 
variances for asset portfolio returns, and the set of structural elasticities in Chetty 
(2012)'s analysis of demand with optimization frictions. We show that the econometric 
structure of the problem allows us to construct convenient and powerful confidence re- 
gions based upon the weighted likelihood ratio and weighted Wald (directed weighted 
Hausdorff) statistics. The statistics we formulate differ (in part) from existing statis- 
tics in that they enforce either exact or first order equivariance to transformations of 
parameters, making them especially appealing in the target applications. Moreover, 
the resulting inference procedures are also more powerful than the structured projec- 
tion methods, which rely upon building confidence sets for the frontier-determining 
sufficient parameters (e.g. frontier-spanning portfolios), and then projecting them to 
obtain confidence sets for HJ sets or MF sets. Lastly, the framework we put forward is 
also useful for analyzing intersection bounds, namely sets defined as solutions to multi- 
ple smooth inequalities, since multiple inequalities can be conservatively approximated 
by a single smooth inequality. We present two empirical examples that show how the 
new econometric methods are able to generate sharp economic conclusions. 
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1. Introduction 

In this paper we consider the problem of inference on a class of sets describing a 
collection of admissible models as solutions to a single smooth inequality: 

60 = G : m(^) < 0}, (1.1) 

where C M.^ is a compact parameter space, and m : R'^ 1— t- M is the inequality- 
generating smooth function, which is estimable from the data. This structure arises in a 
number of important examples in financial economics and public finance, as we explain 
below via a sequence of examples. We show that this structure leads to highly-tractable 
and powerful inference procedures, and demonstrate their usefulness in substantive em- 
pirical examples. Furthermore, the structure could be used to conservatively approxi- 
mate more complicated problems, where the sets of interest are given as intersections of 
solutions to multiple smooth inequalities. (Indeed, in the latter case, we can conserva- 
tively approximate the multiple inequalities by a single smooth inequality. The benefits 
from doing so is the highly tractable, "regular" inference.) 

Example 1: Mean- Va riance Set for the Stochastic Discount Factor. In order to describe 



the problem, we recall ICochrand (l2005l )'s assertion that the science of asset pricing could 



be effectively summarized using the following two equations: 

Pt = Ei[Mi+iXi+i], 
Mt+i = f{Zt^i, parameters), 

where Pt is an asset price, Xt+i is the asset payoff, Mt+i is the stochastic discount 
factor (SDF) or pricing kernel (PK), which is a function / of some data Zt+i and pa- 
rameters, and Et is the conditional expectation given information at time t. The set of 
SDFs Mt that can price existing assets generally form a proper set, i.e. a set that is 
not a singleton. SDFs are not unique as long as the existing payoffs to assets do not 
span the entire universe of possible random payoffs. Dynamic asset pricing models pro- 
vide families of potential SDFs, for example, the standard consumption model predicts 
that an appropriate SDF can be stated in terms of the intertemporal marginal rate of 
substitution: 

where u denotes a utility function parameterized by some parameters, Ct denotes con- 
sumption at time t, and /3 denotes the subjective discount factor. Note that the investor's 
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optimal consumption plan, and therefore the marginal rate of substitution, generally de- 
pend on a set of additional state variables Zj, including lifetime wealth and non-capital 
income. Hence, when markets are incomplete, asset prices alone do not in general pin 
down a unique stochastic discount factor across different aggregate states of the economy 
or across investors with different values of Z^. 

An important empirical problem in the context of the CAPM is to check which fami- 
lies of SDFs price the assets correctly and which do not. This reasoning forms the basis 
for many approaches to estimat ion and tests for particular specifications of an asset 
pricing model, most prominentlv iHansen and SingletonI (1l982l )'s seminal analysis of the 
consumption-based CAPM, see also Ludvigso'nr f 2012l ) for a survey of recent develop- 
ments on the subject. One leading approach for performing the check is to see whether 
the mean and standard deviation of SDFs {fi, a) of the stochastic discount factor Mt are 
admissible. Let i^' be a compact convex body in M^. The set of admissible means and 
standard deviations in K 



9o := { admissible pairs {fi, a"^ 



is introduced by lHansen and Jagannatharu (jl99ll ) and known as the Hansen- Jagannathan 
(HJ) set. The boundary of that set OQq is known as the HJ bound. In order to describe 
the admissible pairs in a canonical setting, let v and S denote the vector of mean returns 
and covariance matrix to assets 1, A^, which are assumed not to vary with information 
sets at each period t. In the following, we consider the "minimal sufficient parameters" : 



5,, 



v'Tj "^f, 5*^,1 = f'S ^1n, Sii = I'^S ""^l 



;i.2) 



where 1^ is a column vector of ones. Then the minimum variance crfijil^) achievable by 
a SDF given mean /i of the SDF is equal to 



• HJ (yu) = (1 - ^^v)' S ^ (1 - /it;) = S^^^i^ - 28^1^1 + 5*11. 
Therefore, the HJ set is equal to 

Qhj = {(/i, a) e (M xR+)nK: - a < 0}. 



m(6») 



That is. 



eo = {9eQ: mHj{9) < 0}. 
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Note that the inequahty-generating function mHj{0) depends on the unknown means fi 
and covariance of returns S via the minimal sufficient parameters, 7 = {S^, Syi, Su)' , 
so that we will write mHj{0) = mHj{0,'y). 



Example 2: Mean- Varia nce Analysis of Ass et Portfolios. Let us now describe the second 
problem. The classical iMarkowitzl ( 1l952l ) problem is to minimize the variance of a 
portfolio given some attainable level of return: 

mmEt[rp^t+i - 'E't[rp,t+i]T such that Ejrp.^+i] = /i, 

w 

where f+i is the return of the portfolio, determined as Tp^t+i = "w'rt+i, and w is a vector 
of portfolio "weights" and r^+i is a vector of returns on available assets. In a canonical 
version of the problem, we have that the vector of mean returns v and covariance of 
returns S do not vary with time period t, so that the problem becomes: 

<7m(/^) = minitj'Sw such that w'v = fi. 

An explicit solution for aMifJ') takes the form, 

2 / \ _ Siijj?' — 2Syijj, + 5*^^ 

where the minimal sufficient parameters 7 = {S^, S^i, Su)' are the same as in equation 

m- 

Therefore given a compact convex body K cM."^, the Markowitz (M) set of admissible 
standard deviations and means in K is given by 

Gm = {(/i, a) G (M X R+) n : (JmI/u) - a < 0} = G : mM(^) < 0}, 

where mM{0) '■= o"M(yu) — o"- The boundary of the set Qm is known as the efficient 
frontier. Note that as in HJ example, the inequality-generating function mM{d) = 
^m{(^i 7) depends on the unknown parameters, the means and covariance of returns, 7 = 
(S',,^,, 5*^1, 5*11)'. We also note that in some apphcations, the complementary Markowitz 
set is of interest: 

61, = {eeQ: -mM{e) < 0}. 

The inference on such sets thus also fall in our framework (11. ip . 

Mean-variance analysis for asset returns and the stochastic discount factor provides 
powerful tools to summarize the opportunities for risk diversification in a given asset 
market, and can serve as a basis for spanning and efficiency tests. The importance of 
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sampling error and the role of statistical in f erence in portfolio analysis has long been 
recognize d in the litera. t ure: Gibbons et al.l ( 119891 ) develop regression-based efficiency 



tests, and lBritten-Joned ( 119991 ) con siders inference on the tangency portfolio and tests for 
portfolio efficiency; recent work by lPenaranda and Sentanal (l2010l ) combines restrictions 
derived from the two dual approaches to mean- variance analysis in order to derive more 
powerful spanning tests for (sub-)mar kets; a sunimary of the recent literature on mean- 
variance efficiency tests is provided by lSentanal (120091 ) . The econometric contribution of 
this paper is quite distinct from and hence complementary to these previous efforts. 



Example 3: Multi-Factor Efficient Frontiers. A third example d u e to 



extends the unconditional portfolio choice problem in iMarkowitzl ( 1l952l ) to multiple 



Famal (119961 ) 



sources of priced risk. More specifically, suppose that there exist k state variables 
Zt = {zit,...,Zkty ("factors") such that {r'^, Z'^)' are jointly normal (or, more gener- 
ally, jointly spherical). We assume that Zt follows a i^'-variate standard normal (or 
other spherical) distribution, and 

Et[rt+^\Zt+,]=v + B'Zt+, 

where v is an A^-dimensional vector of average returns, and the k x N covariance matrix 
B = . . . , /^Tv] = Covt{Zt+i,rt+i) gives the factor loadings for the vector of asset 
returns. As in the previous examples, we denote the (unconditional) mean of asset 
returns by f G M^, and their covariance matrix with S. Note that for the purposes of 
this example, we take to denote the excess return of asset i relative to a risk-free 
asset with return Vf. 

This analysis is motivated by the investor's lifetime consumption-investment problem, 
where the state variables Zt denote other factors influencing prices for consumption 
goods and the household's income from wage labor or entrepreneurial activity in period 
t. In particular the household not only faces a trade-off between mean and variance of 
portfolio returns, but may also wish to use her portfolio to insure against price or income 
shocks. 

Specifically, we consider the problem constructing a minimum variance portfolio that 
targets a mean return fi* and a vector /3* of loadings respect to the factors Zt in the 
absence of borrowing constraints. Denoting D := [v, B'] and 6* := (/i*, /?*')', the efficient 
portfolio weights w can be found by solving the program 



min w'T^w 

w 



s.t. D'w = 6*. 



1.3) 
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Note that since the investor has access to the risk-free asset and faces no short-sale 
constraints, the portfoho weights w are not required to be nonnegative or sum up to 
one. Solving this program yields efficient portfolio weights w* = Il^^D{D'TiD)^^6* so 
that the lower bound on the variance of a portfolio with beta equal to f3 and mean n is 
given by 

aljj,{n,P) := {w*yEw* = 5*\D'^D)-H\ 
The resulting multi-factor efficient frontier bounds the cone 

Qmf := {(/i*, /?*>)' G (R'^+i X R+) n : aMF(/i*,/3*) - a < o} 

of feasible mean-variance-beta combinations, where K is some compact convex body 
in M'^"'"^. Again, this cone can also be characterized by an inequality condition on the 
moment function 



rriMF^e, 7) := ^ f3*'){D'^D)-\ii\ P*')' - a 
where 9 = {12* , /?*', a)' and 7 = vec(D'ED). 



Extending the two-fund theorem, iFamal (Il996l ) shows that we can always construct 



k + 2 different funds from the assets that span the entire multi-factor efficient frontier. 
In the presence of heterogeneity in preferences, wealth and consumption possibilities, 
different investors will typically choose different points on the efficient cone, so that 
only the entire set constitutes an adequate representation of the opportunities to insure 
against different factor risks in the market. 

Example 4- Bounds on Demand Elasticities. To illustrate the wider applicability of our 
methods, we also c onsider i nferen ce on demand elasticities in the presence of an op- 
timization friction. IChettyl (|2012[ ) considers a household's dynamic labor supply and 
consumption problem and proposes bounds on price elasticities that allow for optimiza- 
tion errors of unknown form but that are bounded in magnitude. His analysis assumes 
that preferences can be represented by quasilinear flow utilities and lead to Marshallian 
demand functions of the form 



log = a - e \ogp + V, 



it J 



where p is the price of the good of interest, and a preference shock for household i in 
period t. Here e is the (constant) "structural" demand elasticity corresponding to the 
flrst-best solution to the household's planning problem. 
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Let U*f. be the discounted lifetime utility from following the optimal consumption 
plan and let eit{x) denote the minimal expenditure needed to finance a utility level 
U*^ under the constraint that xu is held fixed at x. The analysis then considers "small" 
optimization errors in the households' observed choice of xu such that for every period t, 
the average utility loss measured in terms of the difference in expenditure as a percentage 
of the optimal budget is bounded by some 6 > 0, i.e. 

J_^jv eajxit) - eit{x*n{pt)) 



w here the average is taken over a sample of households i 



.N. 



Chettyl (I2OI2I ) derives the lower and upper bounds El and eu for the structural elas- 



ticities that are compatible an observed elasticity e° given a price change Alogp, and 
in the presence of an optimization error of 5 > 0. This bound is given by the following 
two inequalities 



eu-2 



{2eu5) 



1/2 



and e° = eL + 2 



1/2 



A log p A log p 

where 6 is assumed to be smallQ With a few basic manipulations, we can obtain the 
following equivalent characterization of the {6, £:)-set as a lower bound for the distortion 
needed to reconcile e° with a structural elasticity e: 



5 > 



8e 



--: 6of{£, A logp) if and only if e G [el, Eu]- 



;i-4) 



Chettyl (I2OI2I ) gives several interpretations for this optimization friction, which could 
result e.g. from adiustmen t costs or misperception of p rices. In the context of the 
consumption-CAPM model, iLettau and LudvigsonI (120091 ) conducted a similar sensitiv- 



ity analysis, allowing for optimization errors in terms of the Euler equations character- 
izing the household's optimal investment decision. 

The resulting set of structural elasticity/optimization friction pairs {e,S) compatible 
with the observed elasticity e° is given by 

GoF := {{e,6) e Rl : 6oF{e,e°,A\ogp) - 6 < O} . 



^See equations (12) and (13) on pages 983-985 in lChettvl (|2012l ) 
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The bounds in equation (11.41) can therefore also be expressed as an inequahty condition 
on the moment function 

^--e°)2(Alogp)2 



8e 



where 6 = {e,6) and 7 = e°, and Alogp will be regarded as a fixed design parameter. 
In applications, one might be interested in int ersections bou nds from several studies in 
order to obtain tighter bounds, as was done in Ichettyl ( 2012 ). As we discuss below and 
illustrate in the empirical example, it is possible to perform this intersection in a smooth 
fashion, retaining the general single-equation structure as given in (11. ip . 

1.1. Overview. The basic problem of this paper is to develop inference methods on 
sets defined by nonlinear inequality restrictions, as the HJ and M sets, while accounting 
for uncertainty in the estimation of parameters of the inequality-generating functions. 
Specifically, the problem is to construct a confidence region R such that 



lim P{eo C R} 



a, 



rf^^(eo,i?) = Op(i/y^), 



where dn is the Hausdorff distance, defined below in (I2.17p . We will construct con- 
fidence regions for the set Gq using LR and Wald-t ype Statistics, buildi r ig on and si- 
multaneously enriching the app roac hes suggested in IChernozhukov et al.l (120071 ) and in 



Beresteanu and Molinaril (120081 ) and iMolchanovl ( 119981 ). respectively. We also would like 
to ensure that confidence regions R are as small as possible and converge to Gq at the 
fastest attainable speed. 

Once R is constructed, we can test any composite hypotheses involving the parameter 
6 without compromising the significance level. E.g. a typical application of the HJ 
bounds determines which combinations of Ge the first two moments of the SDF generated 
by a given family of economic models fall in the HJ set. Indeed, in that case Gg = 
{fi{g), crig), Q G [0, 00)}, where g is the elasticity of power utility function, and we can 
check which values of 7 give us overlap of Gg with the confidence set R for the HJ set|^ 
Similar comments about applicability of our approach go through for the M and MF sets 
as well. We should also mention that our confidence regions R are also more powerful 
than the structured projection methods, which rely upon building confidence sets for the 
frontier-determining sufficient parameters (e.g. frontier-spanning portfolios), and then 
projecting them to obtain confidence sets for HJ sets or MF sets. We demonstrate this 



^In cases where curve g '-^ fJ^iff), g '-^ cr{g) are estimated, we need to construct a confidence set Re for 
and tlien look whetlier (and wliere) i?e and R overlap to determine the plausible values of g. 
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in the empirical section, where we compute the confidence sets for HJ sets based on our 
approach and based on structured projection approach, and show that our confidence 
interval is much smaller and lies strictly inside the confidence region based on structured 
projections. We provide a theoretical explanation to this phenomenon in Section 2.5 

Our procedure for inference using weighted Wald-type statist ics complements other ap 
proaches based on the directed Ha usdorff dista n ce sug gested in 



Beresteanu and Molinari 



( 20081 ) and Hausdorff distance in Molchanov ( 1998 ). By using weighting in the con- 
struction of the Wald-type statistics, we make this approach asymptotically invariant 
to parameter transformations, which results in noticeably sharper confidence sets, at 
least in the canonical empirical example that we will show. These invariance properties 
should be see n as complementary to qu estions of efficient estimation of the identified set 
as studied by lKaido and SantosI (l201ll ): indeed, many types of confidence sets will all be 
centered around the same efficient set estimate, and so we use invariance and precision 
considerations to select amongst various potential constructions of sets. 



Critchley et al.l ( 1l996l ) analyzed the geometric structure of Wald-type tests with non- 



linear restrictions for point-identified problems and pointed out the beneficial role of 
invariance of the distribution of the statistic with respect to reparameterizations. In 
regular problems, the effect of nonlinearities on inference is typically asymptotically 
negligible due to the delta method and its extensions. However, our results show that 
for inference on sets with a nontrivial diameter, the effect of nonlinearities is of the 
same order as that of sampling variation. In particular we propose inference procedures 
that rely on quantities that are asymptotically pivotal and (asymptotically) invariant to 
parameter transformation, and illustrate that procedures failing to meet these require- 
ments lead to overly conservative inference. In particular, these conditions are not met 
by the classical Hausdorff distance except in very special cases. 

Thus, our construction is of indep endent interest for this t y pe of infer ence and is 



Beresteanu and Molinari 



(EoosI) 



and 



Molchanov 



a usef ul complement to the work of 

(|l998k Furthermore, our results on formal validity of the bootstrap for the weighed LR- 
type and W-type statistics are also of independent i nterest (for the class of problems with 
structures that are simila r to those studied here; see lBeresteanu and Molinaril (120081 ) and 



Kaido and SantosI (120111 ) for related bootstrap results for Hausdorff statistics. 



The rest of the paper is organized as follows. In Section 2 we present our estimation 
and inference results. In Section 3 we present an empirical example, illustrating the 
constructions of confidence sets for HJ sets. In Section 4 we draw conclusions and 
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provide direction for further research. In the Appendix, we collect the proofs of the 
main results. 

2. Estimation and Inference Results 

2.1. Basic Constructions. We first introduce our basic framework. We have a real- 
valued inequality-generating function m{9), and the set of interest is the solution of the 
inequalities generated by the function m(^) over a compact parameter space © C R'^: 

Go = e e : m{e) < 0}. 

A natural estimator of ©o is its empirical analog 

©0 = e © : fh(e) < 0}, 

where m(^^) is the estimate of the inequality-generating function. For example, in the 
HJ and M examples, the estimate takes the form 

Remcirk 1 (Approximating Multiple Inequalities by A Single Smooth Inequality). 
Throughout the paper, we will only consider the case of a single inequality restric- 
tion, m : © — > R satisfying certain smoothness conditions with respect to 9. However 
this framework approximately encompasses the case of multiple inequalities, since it is 
possible to conservatively approximate intersection bounds of the form g{9) < 0, where 
g{9) :— {gi{9), . . . ,gj{9))' represents a vector of constraints, by a single smooth bound 
m{9) < 0. To be specific, we can form an inequality-generating function: 



m,{9) := m{gm. . . . , gj{9)- A) := E/^i ^f^^^^/f ),,, ^.(^), 

Ei=iexp(AyK^)) 



where A e R is a fixed, positive scalar. This function mx is clearly smooth with respect 
to the underlying function g, and is a conservative approximation to maxi<j<j gj{9) with 
an explicit error bound, as shown by the following lemma. 

Lemma 1 (Properties of Smooth Max), (i) We have that 

m(gi, ...,gj,X) <0 if max gj (9) < 0. 
(a) For any A > 0, the approximation error obeys 

sup |max(gri,...,grj) -m{gi,...,gj,\)\ < \~^W (- — -) 

(gi,...,9^)'eR^ V e / 
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where e is Euler's number, and the function ly : M i— ?► M Lambert's product logarithm 
function, i.e. W{z) is defined as the solution w of the equation wexp{w} = z. In 
particular, W{z) >Oifz>0, and W{z) < log{z) if z >e. 

This lemma implies that the approximation error relative to the max function (with 
respect to the sup- norm) is inversely proportional to the smoothing parameter A, where 
the constant of proportionality grows only very slowly in J. In the paper we will state 
our results and assumptions on estimation and inference directly in terms of the scalar 
moment m{9) = m\{6) and its empirical analog m(6') := mx{0) := m{'gi{9), . . . ,'gj{9); A). 
We also consider the quantity 1/A fixed at some small value. □ 

Our proposals for confidence regions are based on (1) a LR-type statistic and (2) a 
Wald-type statistic. The LR-based confidence region is 

Rlr ■.= {eee: [V^m{9)/m]l < Ml " «)} , (2-1) 

where 's{6) is a weighting function; ideally, the standard error of m{6); and k{l — a) is 
some suitable estimate of k{l — a), the (1 — a;)-quantile of the statistic 

Cn= snp[V^m{e)/sie)]l. (2.2) 
eeeo 



Note that is a LR-type statistic, as in IChernozhukov et al.l ( 120071 ) 



Next we shall consider confidence regions based on inverting a Wald-type statistic. In 
order to eliminate boundary effects on the distribution of this statistic, we will need to 
assume that the moment conditions, and therefore the set estimator, are well-defined in 
a neighborhood of the parameter space. To this end, we define the distance of a point 6 
to a set y4 C R'' as 

d(e,A) := inf 11^ -^'11. 

We let the set 

:= {oeR'' : d{e,e) < 6} 

denote the 5-expansion of in M'^ for any 6 > 0. We denote the natural set estimator 
for 00 in the expansion of the parameter space by 

00,5 := {eee': m{e) < 0} . 

We will see below that the behavior of the moment function near the boundary of the 
parameter space is in general relevant for the statistical behavior of our procedure. 
However, for the practical applications discussed in this paper, the restriction of the 
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parameter space 6 to a compact set is entirely for technical reasons, so that an extension 
of the definitions to G*^ is unproblematic. 

Given these definitions, our Wald-based confidence region is 

Rw={eee: {^d{e, eo,^)^^)]' < k{i - «)}, (2.3) 

where w{9) is a weighting function, particular forms of which we will suggest later; and 
k{l — a) is a suitable estimate of k{l — a), the (1 — a)-quantile of W„, where W„ is the 
weighted W-statistic 

Wn ■■= sup [^d{e, Q^^s)lw{e)f- (2-4) 
eeeo 

In the special case, where the weight function is fiat, namely w{d) = w for all 6, the W - 
stat istic yVn becomes the canonical directed Hausdorff distance (see 
and 



Beresteanu and Molinaril (120081 )): 



Molchanovl fllQQSh . 



v/y\4 ocrf+ (60,60,5) = sup inf \\9-9'\\. 

Remark 2 (Invariance Motivation for Weighed W-Statistics) . The weighted W-statistic 
(I2.4p is generally not a distance, but we argue in Section 2.6 that it provides a very useful 
extension/generalization of the canonical directed Hausdorff distance. Note that, like 
any Euclidean norm, the Hausdorff distance is not invariant with respect to changes 
in the scale of the different components of 6, which results in confidence regions that 
are not equivariant to such transformations. Our empirical results below illustrate that 
as a result, the shape of confidence sets alters dramatically as we change the weight 
on different dimensions of the parameter space. In sharp contrast, the introduction of 
certain types of weights makes the W-statistic (first-order) invariant to such transforma- 
tions, as shown in Section 2.6, which makes the resulting confidence regions (first-order) 
equivariant. As a result, in all of our empirical examples such weighting dramatically 
improves the confidence regions (e.g. compare Figures [5] and [6] in the example concerning 
inference on HJ sets.) □ 

2.2. A Basic Limit Theorem for LR and W statistics. In this subsection, we de- 
velop a basic result on the limit laws of the LR and W statistics. We will develop this 
result under the following general regularity conditions: 

Condition R. For some 6 > 0, the inequality- generating functions m : 6*^ — t- M and 
its estimator m : 6'' — ?■ M are well-defined. 
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(R.l) The estimator m is n'^/"^ -consistent for m and asymptotically Gaussian, namely, 
in the metric space of hounded functions £°°(6'^) 

\fn{m — m) G, 

where G is a zero-mean Gaussian process with continuous paths a.s., and a non- 
degenerate covariance function, i.e. infggea E[G^(6')] > 0. 
(R.2) Functions m and m admit continuous gradients V^m : i— t- M'^ and V^m : 
I— 7- M'^ with probability one, where 

Vem{e) = Vem{e) + Op{l). 

uniformly in 9 G 0^. The gradient 9 ^ Vgm is uniformly Lipschitz and 
infeg©^ \\Vem{9)\\ > 0. 
(R.3) Weighting functions s" : O"^ t— t- M and s : O'' t— t- M satisfy uniformly in 9 E 

s{9) = s{9) + Op{l), w{9)=w{9) + Op{l), 

where s : 6*^ i— t- M+ and w : M+ are continuous functions with values 

bounded away from zero. 



In Condition R.l, we require the estimates of the inequahty-generating functions to 
satisfy a uniform central hmit theorem. Man y sufficient conditions for th i s are p rovided 
by the theory of empirical processes, see e.g. Ivan der Vaart and Wellnerl ( 119961 ). In our 
finance examples, this condition will follow from asymptotic normality of the estimates of 
the mean returns and covariance of returns. In Condition R.2, we require that gradient 
of the estimate of the inequality-generating function be consistent for the gradient of 
the inequality-generating function. Moreover, we require that the norm ||Ve)m(^)|| be 
bounded away from zero, which is an identification condition and allows us to estimate, 
at a parametric rate, the boundary of the set 6o, which we define as 



dQo ■={9eQ: m{9) = 0}. 

In Condition R.3, we require that the estimates of the weight functions be consistent, 
and the weight functions be well-behaved. 

Under these conditions we can state the following general result. 
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Theorem 1. (Limit Laws of LR and W Statistics). Under Condition R 

> 1 2 



Cn C, C = sup 

>V„ W, W= sup 



(2.5) 



(2.6) 



[\\Vemie)\\-wie) 

where both W and C have distribution functions that are continuous at their (1 — a) 
quantiles for a < 1/2. Furthermore, if 

s{0) 



w{e) 



+Op(l), 



||V,m(^)|| 

uniformly in 9 E , then the two statistics are asymptotically equivalent: 

Wn = Cn + Op{l). 

In particular, this equivalence occurs if w{9) = ||V6im(^^)||/s'(^). 



We see from this theorem that the LR and W statistics converge in law to well-behaved 
random variables that are continuous transformations of the limit Gaussian process G. 
Moreover, we see that under an appropriate choice of the weighting functions, the two 
statistics are asymptotically equivalent. 

For our application to HJ and MF sets, the following conditions will be sufficient. 



Condition C. 

(C.l) We have that m{6) = m(6', 70), where 70 G F C M.'^, for all 9 E and some 
6 > 0. The value 70 is in the interior ofV , and there is an estimator^ of'jo that 
obeys 

v^(7-7o) ^fii/^Z, Z = iV(0,/,). 

for some positive- definite Hermitian matrix Vl. Moreover, there is Vt — T-p Vt. 
(C.2) The gradient map (^,7) ^ \/gm{9,'y), mapping x F 1— )■ M.'' , exists and is 

uniformly Lip schitz- continuous. Moreover, inf(g ,y)g0ixr II V6)W^(6', 7) || > 0. 
(C.3) The gradient map (^,7) ^ V^m(^,7), mapping 0"^ x F 1— t- R*^, exists and is 

uniformly Lip schitz- continuous. 

We show in Proposition [T] that these conditions hold for the canonical versions of the 
HJ and MF problems. Under these conditions we immediately conclude that in the 
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metric space of bounded functions i°°{Q^): 

v/n(m(-) - m(-)) = V^m(-,7)'v^(7-7o) + Op(l) 
w V,m(-,7o)'fii/2^, 

where Vj'm{6,'y) denotes the gradient with each of its rows evaluated at a value 7 on 
the line connecting 7 and 70, where value 7 may vary from row to row of the matrix. 
Therefore, the limit process in HJ and M examples takes the form: 

G(^) = V^m{e, -foY^^^^Z. (2.7) 

This will lead us to conclude formally below that conclusions of Theorem 1 hold with 

■V-,m(^,7o)'l]i/2 



sup 



sup 



V^m(^,7o)'fii/2 



(2i 



(2.9) 



\\Vemie,^o)\\-wie) 

A good strategy for choosing the weighting function for LR and W is to choose the 
studentizing Anderson-Darling weights 

s{6) = ||V^m(e,7o)'r]^/'||, (2.10) 
|V,m(^,7o)'^^^/'|| 



w{e) 



(2.11) 



l|V,m(^,7o)|| ■ 

The natural estimates of these weighting functions are given by the following plug-in 
estimators: 



s{e) := ||V^m(e,7)'ni/2| 
^(^) l|V,m(^,7)'^^^/^| 



l|V,m(^,7)|| 

We formalize the preceding discussion as the following corollary. 



(2.12) 
(2.13) 



Corollary 1. (Limit Laws of LR and W statistics under Condition C). Un- 
der Condition C, Conditions R holds with the limit Gaussian process stated in equation 
l[2. 7| ). The plug-in estimates of the weighting functions ^2.12^) and /12.13\) are uniformly 
consistent for the weighting functions 1^2. 10\) and ^2.11\) . Therefore, conclusions of The- 
orem 1 hold with the limit laws for our statistics given by the laws of random variables 
stated in equations l{2.8\) and l{2.9\) . 
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2.3. Basic Validity and Convergence Rates for the Confidence Regions. We 

will first give a basic validity result for confidence regions assuming that we have suit- 
able estimates of the quantiles of LR and W statistics and will verify basic validity of 
our confidence regions. A basic procedure for constructing suitable estimates of these 
quantiles via bootstrap or simulation will be given below. 

Theorem 2. (Basic Inferential Validity of Confidence Regions). Suppose that 
for a < 1/2 we have consistent estimates of quantiles of limit statistics W and L, 
namely, 

^(l-Q;) = A;(l-Q;) + Op(l), (2.14) 

where k{\ — a) is the (1 — a)-quantile of either W or L, respectively. Then as the 
sample size n grows to infinity, confidence regions Rlr and Rw cover ©o with probability 
approaching 1 — a: 

P[©o C Rlr] = P[Cn < Ml - «)] ^ Pi'^ < ^(1 - «)] = (1 - «), (2.15) 
P[©o Q Rw] = P[Wn < Ml - a)] F[W < k{l - a)] = (1 - a). (2.16) 

We next recall that given a Euclidian metric d{-,-) on R^, the (symmetric) Hausdorff 
distance between two non-empty sets A,B gM.^ is defined as 

dniA, B) := max \ sup d{a, B), sup(6, A) \ . (2-17) 

Our next result shows that the confidence regions based on the LR and the Wald statistic 
are also root-n consistent estimators for the set ©o with respect to the Hausdorff distance. 
This result also establishes that the Pitman rates for Ryy and R^^ is also root-n, namely 
any alternative set ©^ such that ^^(///(©o, ©a) oo will not be covered by Rw and 
Rlr, and hence will be rejected with probability tending to 1. 

Theorem 3. ( Confidence Regions are ^/n- Consistent Estimator ofQo). Under 
Condition R, the confidence regions based on the LR and Wald statistics are consistent 
with respect to the Hausdorjf distance at a root-n rate, that is 

dniRLR: ©o) = Op{n-'/^) and dn^Rw, ©o) = Op{n-^/^). 

As a consequence, if y/ndni^oT^A) oo, then 

P[©A C Rlr] and P[Qa C Rw] ^ 0. 
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While the results in Theorems [T]|3] were stated in terms of the high-level Condition R, 
the next corollary shows that these results can be applied to any problem where Condi- 
tion C holds. These conditions hold in HJ, MF, and other problems listed in Section 1, 
as we verify below. 

Corollary 2. (Limit Laws of LR and W statistics under Condition C). Sup- 
pose that Condition C holds and that consistent estimates of quantiles of statistics ^KEj 
and ^2. 9^ are available. Then the conclusions of Theorem 2 apply. 

We now turn to estimation of quantiles for the LR and W statistics using bootstrap, 
simulation, and other resampling schemes under general conditions. The basic idea is 
as follows: First, let us take any procedure that consistently estimates the law of our 
basic Gaussian process G or a weighted version of this process appearing in the limit 
expressions. Next, we can use the estimated law to obtain consistent estimates of the 
laws of the LR and W statistics, and thus also obtain consistent estimates of their 
quantiles. It is well known that there are many procedures for accomplishing the first 
step, including such common schemes as the bootstrap, simulation, and subsampling, 
which can also be adapted to allow for various forms of cross-section and time series 
dependence. 

In what follows, we will simplify the notation by writing our limit statistics as a special 
case of the following statistic: 

s := sup [v{e)]l, v{e) := T{e)G{e). (2.18) 

Thus, 5 = £ for T{e) = l/s{e) and 5 = W for r(^) = l/[\\Vem{e)\\-w{e)]. We take r to 
be a continuous function bounded away from zero on the parameter space. We also need 
to introduce the following notations and concepts. Our process is a random element 
that takes values in the metric space of continuous functions C(6) equipped with the 
uniform metric. The underlying measure space is {Q, J-") and we denote the law of V 
under the probability measure P by the symbol Qy- 

In the following we will assume that we have an estimate Qv* of the law Qv of the 
Gaussian process V. This estimate Qv* is a probability measure which can be generated 
as follows: Let us fix another measure space {Q',J^') and a probability measure P* on 
this space. Then given a random element V* on this space taking values in C(6), 
we denote its law under P* by Qv*- We thus identify the probability measure P* 
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with a data-generating process by which we generate draws or reahzations of V*. This 
identification allows us to cover such methods of producing realizations of V* as the 
bootstrap, subsampling, or other simulation approaches. 

We require that the estimate Qv* be consistent for Qv in any metric px metrizing 
weak convergence, where we can take the metric to be the Kantarovich-Rubinstein met- 
ric. Note that there are many results that verify this basic consistency condition for 
different processes V and various bootstrap, simulation, and subsampling schemes, as 
we will discuss in more detail below. 

To define the Kantarovich-Rubinstein metric, let 9 ^ v{9) be an element of a metric 
space (M, c^m), and BLi(M) be a class of Lipschitz functions : M — )■ M that satisfy: 

\ip{v) - ip{v')\ < dM{v,v'), sup |v3(t;)| < 1. 

The Kantarovich-Rubinstein distance between probability laws Qv and Qy of random 
elements V and V taking values in M is defined as: 

PKiQ,Q';M):= sup \EQ^{V)-EQ,^iV)\. 

(^GBLi(Af) 

As stated earlier, we require the estimate Qv* to be consistent for Qy in the metric px, 
that is 

PKiQv*,Qv;CiQ))=Opil). (2.19) 

Let Qs denote the probability law of 5 = W or £, which is in turn induced by the law 
Qv of the Gaussian process V. We need to define the estimate Qs* of this law. First, 
we define the following plug-in estimate of the boundary set dOo, 

dQ^ = {O^Q, fh{9) = 0}. (2.20) 
This estimate t urns out to be consistent at the usual root-n rate, by an argument similar 



to that given in IChernozhukov et al.l ( 120071 ). Next, define Qs* as the law of the random 
variable 

S* = sup [V*{9)]1. (2.21) 

In this definition, we hold the hatted quantities fixed, and the only random element is 
V* that is drawn according to the law Qy*. 

We will show that the estimated law Qs* is consistent for Qs in the sense that 

PK{Qs*,Qs;^) = Op{l). (2.22) 
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Consistency in the Kantarovich-Rubinstein metric in turn implies consistency of the esti- 
mates of the distribution function at continuity points, which in turn imphes consistency 
of the estimates of the quantile function. 

Equipped with the notations introduced above we can now state our result. 



Theorem 4. (Consistent Estimation of Critical Values k{l ~ a)). Suppose 
Conditions R.1-R.3 hold, and that we have a consistent estimate of the law of our limit 
Gaussian processes V . Then the estimates of the laws of the limit statistics 5 = W 
or L defined above are consistent. As a consequence, we have that the estimates of the 
quantiles are consistent in the sense of equation 1^2. 

It is useful to give a similar result under more primitive conditions C.1-C.2. Recall 
that in this case our estimator satisfies 

v^(7 - 7) ^ 9}''^Z, 

so that our limit statistics take the form: 

s = sup [v{e)]l, v{e) = t{eyz, 

where t{6) is a vector valued weight function, in particular, we have 

t{e) = {V^m{e, 7)'fii/2)/s(^) for S = C, 

t{d) = (V-,m(^,7)'l]i/2)/(||Vem(e,7)|| ■ w{9)) for S = W. 

Here we shall assume that we have a consistent estimate Qz* of the law Qz of Z, in the 
sense that, 

PKiQz',Qz) = Opil). (2.23) 

For instance we can simulate the distribution using draws Z* ~ A^(0, /) or apply any 
valid bootsrap method to r2 ~^/^-^/n('7— 7). Alternatively it is possible to use subsampling 
(jPolitis and Romanol ( Il994j )). Then the estimate Qy of the law Qy* is defined as: 

v*{e) = t{eyz\ (2.24) 

where t{6) is a vector valued weighting function that is uniformly consistent for the 
weighting function t{6). In particular, we can use 

t{e) = (V^m(^, ^yQ^/^)/s{e) for S = C, 

t{e) = (V^m(^,7)'fii/2)/(||Vem(^,7)|| ■ for S = W. 



20 



In this definition we hold the hatted quantity fixed, and the only random element 
being resampled or simulated is Z*, with the law denoted as Qz*- Then, we define the 
random variable 

s* = sup [v\e)]l, 

and use its law Qs* to estimate the law Qs- In the definition of the law Qs* we fix, i.e. 
condition on, the estimated quantities t{-) and dQo. 

We can now state the following corollary, which is proven in the appendix: 



Corollary 3. (Consistent Estimation of Critical Values under Condition C). 

Suppose that conditions C holds, and that we have a consistent estimate of the law of Z , 
so equation 1^2. 23\) holds. Then this provides us with a consistent estimate of the law of 
our limit Gaussian process G, and hence all of the conclusions of Theorem 3 hold. 

2.4. Verification of Condition C for MF- and HJ-Bounds. We can now apply 
this result to the HJ and MF problems. As before, let v := E[r(_(_i] and S := E[(rt_|_i — 
v){rt+i — v)'], and the parameters 9 = {^,ay, and 'Jhj = {Syy, S^i, Sn)' for the HJ 
bounds, where 

= v'T.^'^v, Syi = I'jyT.^'^v, and = l'j^T.~^l]sf. 

Also let denote the sample analog obtained by replacing v and S with the mean v 
and variance S of the sample ri, . . . ,rr. For the MF bounds, let Jmf = vec(D'SD), 
where 

B := Covt{Zt+i,rt+i) and D := [v,B'] 

and 'Jmf be the sample analog, and let /i* and (3* be the target mean portfolio return 
and factor loadings. 

Recall that the Hansen- Jagannathan mean-variance bound for the stochastic discount 
factor is defined by the moment condition 

mHj{0, jHj) = a/ S^yfj,"^ - 2Syifi + 5*11 - a. 

whereas the multi-factor efficient (MF) mean-variance set for portfolio returns is char- 
acterized by 

rriMFie, Imp) := ^/{tF^FW^W\t^^*l^' - ^ 
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We now give regularity conditions for the validity of inference and confidence intervals 
based on the LR and Wald statistics derived from the moment functions mHj{0) and 
fnMFid), respectively. 

Proposition 1 (Verification of Condition C for HJ and MF problems). Suppose 
that V, Ti, and B satisfy a CLT. Furthermore, let Q be a rectangle inWx M+, that the 
absolute values of the elements of v and eigenvalues of S are bounded between finite 
strictly positive constants, and that N^^^^vf — ^Xlili"^*) bounded away from zero. 
Then Condition C holds for the HJ and MF hounds. 

For the last condition, note that for the purposes of this paper, we treat the number 
of assets as finite, and that the Cauchy-Schwarz inequality implies that the difference 
^YliLi'^i ~ {^iLi^i^ ^ 0- The difference will be strictly positive only if the mean 
return Vi is not constant across assets. 

2.5. Structured Projection Approach. We also compare our procedure to alterna- 
tive confidence regions from a structured projection approach that is based on a con- 
fidence set for the point-identified parameter 7 that characterizes the bound m{6) = 
m{6,'-f) under the condition C. This confidence region is then "projected" to obtain 
a confidence region for Gq. We call the projection approach "structured" when 7 is 
a minimal sufficient parameter for the bounds on 6, and 7 is the minimal sufficient 
statistics. For example the HJ bounds from Example 1 can be characterized by the 
three-dimensional parameter 

7 = ('S'ot, Syl, S'li)', 

whereas the Markowitz-Fama mean- variance frontiers are described in terms of the first 
two moments of the spanning tangency portfolios. Note that the structured approach 
avoids creating some confidence regions for high-dimensional mean and variance param- 
eters V and S and then projecting them to obtain a confidence region for Bq. Such an 
approach would be extremely conservative, and working with the minimal parameter 
7 instead reduces the conservativeness dramatically. Since inference for 7 is standard, 
the structured projection approach seems much more natural and "economically ap- 
pealing". However, despite its dimension-reducing and intuitive appeal, this approach 
remains very conservative and is much less powerful than the approach based on the 
optimally weighted LR-type and W-statistics. We illustrate the superior performance 
of LR-based confidence sets compared to the structured projection approach empirically 
for the Hansen- Jagannathan set (see Figure [7]). 
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For the construction of projection confidence sets, we assume that we can construct a 
(1 — a) confidence set for 70. This confidence region can then be projected onto the 
parameter space B to form the confidence set for Oq: 



Rproj:= U {9ee:m{e,j)<0}. 

76-R7 



(2.25) 



Under condition C, v^(7 ~ 7o) satisfies a CLT, so we can construct an elhptical joint 
1 — a confidence region for the quantity 7 as follows: 

i?^ = e r : n(7 - 7)'^^''(7 - 7) < c(l - «)}, (2.26) 

where c'(l — a) is either 1 — a-quantile of x^(^) variable, or any consistent estimate of 
such a quantile. Under asymptotic normality of 7, this construction approximates an 
upper contour set of the density of the estimator, and therefore gives an (approximate) 
smallest-volume confidence set for the parameter 7. 

Proposition 2. (Basic Validity of Structured Projection Approach). Let R, 

be 1 — a confidence set for the parameter 7. Then P[0o C Rproj] > P[7o ^ R-y] = - 
In particular, under Condition C, the region R^ in Ii2.26\) obeys 

P[7oGr] = l-« + o(l), 

so that the confidence region l{2.25\) based on structured projection obeys 

P[0o C Rproj] > l-a + o(l). 



'7 

a. 



Projected confidence sets were first proposed by IScheffa (119531 ) for the problem of 



joint confidence bounds for all linear combinations of the form c'7, where 7 G M is a 
parameter vector and c G S^^^, the d—1 dimensional unit sphere in R'^. For the problem 
of confidence ba nds for the linear regression function, it has been shown that optimality 
of lScheffg (119531 ) 's method depends crucially on equivariance with respect to translations 



and orthogonal transfo r matio ns of the original p aramete r , see sections 9.4 and 9.5 in 



Lehmann and Romano! (120051 ) for a discussion. 



Bo: 



irerl (119731 ) showed optimality of 



projection bounds for a generalization of lScheffg (119531 ) 's original problem, but they were 
shown to be suboptimal if coverage was only required for a r estricted set of functionals 



see 



(1980 


) and 


Naiman 


(1984) 



Casella and Strawderman 



In our case, there are at least three regards in which equivariance with respect to the 
reduced- form parameters 7 fails: in all our examples, (1) the dimension of 6 is strictly 
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lower than that of 7, (2) we consider inference problems that are one-sided rather than 
symmetric where the parameter space is only a compact subset of M'^, and (3) the inequal- 
ity generating function m{6, 7) is nonlinear in 7. Even after approximate linearization, 
our inference problem reduces to inference on c'7 with c restricted to a small subset 
of the sphere which is exactly the case where the structured projection approach 

becomes suboptimal. As mentioned above, we illustrate the superior performance of 
LR-based confidence set compared to the structured projection based set empirically for 
the HJ problem (see Figure [7]) - the LR-based set is much smaller and lies strictly inside 
the projection-based set. 



2.6. Invariance and Similarity Properties of Confidence Regions based on LR 
and Wald Statistics. We next proceed to state the invariance properties of the pro- 
posed inference procedures with respect to parameter transformations. We distinguish 
between an exact invariance and an asymptotic invariance. A parameter transformation 
is a one-to-one mapping 17 : 6 — )■ T, where T = 77(6), and we denote the population 
and sample moment conditions for the transformed problem by m^(-) := m{ri'^{-)), and 
m^(-) := m(?7~^(-)), respectively, which are mappings from T — )■ M. We define H* as the 
set of parameter transformations r] that are continuously different iable in 6' G 0*^ and 
such that Conditions R.1-R.3 hold for the transformed moments m^(-). 

In this section we discuss invariance properties of inference based on the LR and W 
statistics. Let 

Cn{e) := [V^m^{e)/s{e)]l and >V„(^) := (v^rf(^, eo,5)M^))' , 

and let the critical values — and kw{l — a) be consistent estimators of the 
asymptotic 1 — a quantiles of £„ = supgg0g CniO) and >V„ = supg^Q^^ Wn{0), respectively. 
Consider the decision functions for including a value of 6 in the confidence regions based 
on the LR and Wald-type statistics, respectively, 

(j)Ln{0) := HCnie) < Ml - a)} and <Pwn{0) := < W(l - ")}, 

respectively. Similarly, given the parameter transformation rj, we consider the statistics 

Cn{e;7]) := [^mM^))/s{e)]l and W„(^;r/) := (v^d(77(e), r/(eo,5))M^; r/))' 
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with a weight function w{9; rj) possibly depending on i], and the resulting decision func- 
tions 

cPLn{0;v)-=HJ^n{0;v)<kLrj{l-a)} and cPwn{0;v)-=H'^n{0;v)<kwvi^-a)}, 

where the critical values — a) and kwn{i — a) are estimates of the asymptotic 1 — a 
quantiles of Cniv) = ^^PeedOo ^n{0] v) and Wniv) = supg^QQ^ W„{9; rj), respectively. 
We say that the decision function (l)n{0;r]) is 

• invariant if for any ^ e 0, we have 0n(^) = 0n(^; v) fo'^ parameter transfor- 
mation rj e H*, 

• asymptotically invariant to first order if for any parameter transformation 77 e 
H*, any 9o e ©, and any sequence 9n — >■ 9, we have P {(t)n{9n) 7^ (t>n{9n', rj)) — >■ 
as n — > 00. 

These invariance properties describe whether the parameter transformations affect the 
inclusion of any sequence of points ^„ in the confidence sets. Note in particular that 
(exact) invariance is a property of a given realization of ff) ^"^^ implies asymptotic 
invariance. Furthermore, it is easy to verify that invariance of a decision function <pn{9] rj) 
implies analogous equivariancc properties for the corresponding confidence sets 

RLR = {9eQ: (t>Ln{9; v) - 1} and Rw = {9 e Q : cl>wn{9; v) - 1}- 

We also would like to mention another property, which characterizes the precision of 
the confidence sets. For a given value of 77, we also say that a test based on (f)n{9; rj) is 
asymptotically similar on the boundary of ©0 if lim„ E[0„(^; 77)] = c for some constant 
c e (0, 1) and any 9 e 9©o- Similarity here means that any point on the boundary of 
the identified set can be expected to be included with asymptotic probability c, which 
does not vary with the location of the point. 

Given these definitions, we can now characterize the invariance and similarity prop- 
erties of inference procedures based on the weighted LR and Wald statistics: 

Proposition 3. (Invariance Properties of Decision Functions based on Cn{9; t]) 
and yVn{9;ri)). Suppose Conditions R.1-R.3 hold. Then (i) the decision function 
4>Ln{9',ri) based on Cn{9;ri) is invariant, and asymptotically similar on the boundary of 
©0, whereas (ii) the decision function (l)wn{9',ri) based on yVn{6','n) o,symptotically in- 
variant to first order if the weighting function is of the form w{9; rj) = m\ii(e))\\ 
for some function h{9) > that does not depend on rj and all 9 e dQ. Furthermore, (Hi) 
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ifw{9; rf) = m%(e))\\ ~^ '^p(^) /^'^ some constant b > 0, then (j)wn{(^) is asymptotically 
similar on the boundary of Qq. 

Notice in particular the different roles the norm of the gradient of m{6) and the stan- 
dard deviation s{6) play for the properties of the weighted Wald statistic: Choosing 
weights that are inversely proportional to ||V6i«^(^)|| in the limit corrects for the depen- 
dence of the Hausdorff distance on the parameterization of the problem, and accounting 
for s{6) also gives similarity on the boundary. In particular, only the weights in part 
(iii) of Proposition |3] yield results for confidence sets based on the Wald statistic that 
compare to the performance of LR-based inference. Our empirical results below illus- 
trate that the difference is important, since the lack of invariance or precision can lead 
to overturning the main economic conclusions in the empirical analysis. 



3. Empirical Applications 

3.1. Hansen- Jagannathan Mean- Variance Sets for the SDF. In order to illus- 
trate the performance of our procedure, we estimate confidence sets for the Hansen- 
Jagannathan sets of me an-variances of stochastic discou nt factors. In order to keep 



results comparable with iHansen and Jagannathan! (119911). we construct the sarnple fo r 



the empirical exercise following the data description in [Hansen and JagannathanI ( 1l99ll ). 
The two asset series used are annual treasury bond returns and annual NYSE value- 
weighted dividend included returns. These nominal returns are converted to real returns 
by using the implicit price deflat or based on personal consumption expenditures used by 
Hansen and Jagannathan Jl99ll ). Asset returns are from CRSP, and the implicit price 



deflator is available from St. Louis Fed and based on National Income and Product 
Accounts of United States. We use data for the years 1959-2006. 

Figure [T] reports the estimated bound consisting of the mean-standard deviation pairs 
which satisfy 

m(e,7) = 0, 

where 7 is estimated using sample moments. 

We can compare the estimated HJ bounds with mean-variance combinations implied 
by the consumption CAPM model. In the model the economy is equivalent to a repre- 
sentative agent with constant elasticity of intertemporal substitution preferences 
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where Ct is the aggregate consumption. Then the stochastic discount factor imphed by 
consumption growth is given by Mt{g) = (3 (^c^) ; so that we can estimate the first two 
moments of Mt{g). Specifically, let Hc{q) '■= ^[Mt{Q)\ and oc{q) '■= y^Var (Mt(^)) be 
the mean and standard deviation, respectively, of the stochastic discount factor given an 
iso-elastic utility function with an elasticity of intertemporal substitution equal to 1/ q. 
We can characterize the feasible set Gg of mean- variance pairs by the moment restriction 

= m((/i, a)) := /ic7((Tc^(^)) - 

where cr^^(-) denotes the inverse function of crc(f))|§ Given an i.i.d. sample of observa- 
tions for the growth rate of consumption we define the empirical analog fh{{fi, a)) 
analogously, where the expectations in the definition of yUc(') and crc(-) are replace 
with averages. The mean- variance pairs reported in Figure [T] were obtained using data 
on per capita expenditures for non-durable consumption and services in the U.S. from 
1959-2006, assuming a discount factor /3 = 0.95. 

It is well known that it is difficult to reconcile asset prices and aggregate consumption 
empirically in a representative framework. For our data, the values of g corresponding to 
mean- variance pairs for the SDF that fall inside the estimated HJ set range from about 
170 to 192, suggesting a very low elasticity of intertemporal substitution and implying 
a high variance for the SDF. These values for g are unrealistically large, but are in line 
with other findings in the empirical literature on the consumption based CAPm|^ 

In order to represent the sampling uncertainty in estimating 7, we plot 100 bootstrap 
draws of the HJ frontier in Figure [21 where observations were drawn with replacement 
from the bivariate time series of stock and bond returns. In order to represent the sam- 
phng uncertainty in estimating the mean-standard deviation pairs {fic{g), crcio)) of the 
consumption-based SDF, we also plot 100 bootstrap draws of g {pc{.Q)^^c{.Q))- We 
see that the sampling uncertainty is quite considerable for both HJ frontier and for mean- 
standard deviation pairs implied by the consumption-based SDF. In fact, an intriguing 
feature of this graph is that near the apex of the HJ frontier, the pair {j1c{.Q)^^c{.q)) 
and its bootsrap draws are close to the HJ frontier at low values of g. The low values of 
g are considered to be "reasonable," since they correspond to a relatively high intertem- 
poral elasticity of substitution and appear to be well micro-founded . The inability of 



■^It can be verified that acio) is strict ly increasing in p, so that this inverse is well-defined. 
^E.g. in Lettau and LudvigsonI ( 2009[) . the values for the elasticity of intertemporal substitution mini- 
mizing the mean squared error in the Euler equations characterizing the household's investment problem 
are comparable in magnitude. 
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"reasonable" values of q to reconcile aggregate consumption data with asset prices has 
been a major theme of the empirical l iterat ure on the consumption-based CAPM model 
starting with iHansen and Singleton! ( 1l982l ). However from Figures 1 and 2 it is not 



obvious that an empirical test will reject the canonical/baseline model underlying that 
literature, and we report confidence sets based on the various approaches discussed in 
earlier sections based on which we can make inferential statements about the benchmark 
model. 

Figure [3] shows the 95% confidence region based on the LR statistic. By construction, 
the LR confidence region covers most of the bootstrap draws below the HJ bounds. 
However, it should also be noted that the confidence bound based on the LR statistic is 
fairly tight relative to the boostrapped frontiers, and does not include any unnecessary 
areas of the parameter space. Noting that the set 6e for the consumption-based SDF is 
defined by a moment equality, we can also form a confidence band i?e for ©e based on the 

r ^ -i 2 

statistic Cn{9) := "^^^q^^^ ■, noting that the asymptotic arguments in the derivations 
for the (one-sided) LR statistic can be easily extended to the two-sided case if we replace 
squared positive parts [-J^ with the usual square, (■)^. The lower and upper bounds in 
the following figures were constructed using separate estimates of the local standard de- 
viation a)) based on the negative and positive deviations of m((/i, cr)), respectively 
to improve the approximation. Critical values were obtained using the nonparametric 
bootstrap. 

Most importantly, the LR-based confidence region for the HJ set does not overlap 
with the confidence set for the consumption-based SDF for "small" values of q (in fact, 
for any q G [0, 120]). The absence of overlap for the 95% regions implies the rejection of 
any q G [0, 120] at 10% significance level. This is clear evidence against the benchmark 
formulation of the consumption-based CAPM, and therefore an important empirical 
conclusion. In what follows below we will show that the same empirical conclusion cannot 
be reached for this example using less precise or non-invariant methods. Specifically, we 
will show that if we use confidence regions based on either LR-statistic without precision 
weighting, or Wald statistics without invariance/precision weighting, or regions based 
on structural projection, we will not be able to reach the same empirical conclusion. So 
invariance and precision considerations in construction of the confidence regions turn 
out to be quite important for reaching sharp economic conclusions. 
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Figure H] plots the 95% confidence region based on an unweighted LR statistic. Com- 
paring Figure Eland Figure H] it can be seen that precision weighting plays a very impor- 
tant role in delivering good confidence sets. Without precision weighting, the unweighted 
LR statistic delivers a confidence region that includes implausible regions in the param- 
eter space where the standard deviation of the discount factor is zero. Moreover, the 
confidence region becomes too imprecise to reject the canonical model of the stochastic 
discount factor. 

Figure [6] plots the confidence region based on the Wald statistic with no invari- 
ance/precision weighting, which is equivalent to a confidence region based on the di- 
rected Hausdorff distance. Similar to Figure H] the confidence set covers a large area of 
the parameter space which is excluded from any bootstrap realization of the HJ set. The 
shape of the confidence region based on the Wald statistic in Figure [6] seem counter- 
intuitive because at first sight, as the confidence bounds do not appear to be a uniform 
enlargement of the estimated frontier dQ. However, this visual impression is only due 
to the fact that the plot shows units of fi and a at different scales. The observation 
that the weighting and scaling of the different components of 6 seem "unnatural" in this 
particular graph emphasizes the potential problems associated with the non-invariance 
of inference based on the unweighted Wald statistic. 

Figure [5] plots the confidence region based on the weighted Wald statistic, where 
weights induce first order invariance and similarity via precision weighting. This weight- 
ing fixes the problem and generates a statistic that is (first-order) invariant to parameter 
transformations. As a result, the confidence set looks very similar to weighted LR based 
confidence set in Figure [3] in that it covers most of the bootstrap draws below the HJ 
bounds and its shape refiects local sampling uncertainty in an adequate manner. This 
practical evidence therefore emphasizes the importance of introducing invariance and 
precision inducing weights in the Wald-based approach, which we had argued for theo- 
retically in the previous sections. 

Finally, in Figure 7 we compare our results to confidence regions from the structured 
projection approach that is based on a confidence set for the point-identified parameters. 
As described in section 2.5, we construct an elliptical joint 1 — a confidence region for 
the quantity 7 defined in equation (11. 2p based on the quadratic form for the estimator 
7, T(7) := (7 — 7)Var(7)~^(7 — 7). For the diameter of this confidence ellipsoid we used 
both a bootstrap and a chi-square approximation to the distribution of ^(7), which both 
yield qualitatively similar results. 
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The 1 — a confidence set for 6 = {fi, a) is obtained by projecting tlie 1 — a confidence 
region for 7 onto G using tlie condition 171(6,'-/) < 0. We report tlie resulting confidence 
set from tlie structured projection approacli in Figure [7] togetlier witli tlie LR-based 
confidence set proposed in this paper. The structured projection confidence set performs 
quite poorly relative to the LR-based confidence set: in particular the latter is much 
smaller and lies strictly inside the former. In fact, the precision of the confidence set 
based on structured projection is poor enough to overturn the major empirical conclusion 
that the consumption-based CAPM cannot be reconciled with small values of g. 

This should be expected since the projection confidence bounds are based on a confi- 
dence set for the point- identified parameter that does not account for the specific shape 
of the bounds as a function of 7. More specifically, the elliptical joint confidence set 
for 7 (which minimizes volume under joint normality of 7) guards us against deviations 
from the true value in any direction in M^, but most of these deviations are irrelevant 
for the bounds for 6, since these are only one-sided and the parameter space for 6 is 
only two-dimensional. The fact that the standard confidence set for 7 treats all direc- 
tions in the parameter space F symmetrically may be far from ideal for inference on the 
(/i, cr)-frontier, since the bound on the standard deviation is a nonlinear function whose 
derivative with respect to 7 varies widely across different values of (/x,cr(/i)). Note that 
for confidence sets for a point-identified parameter, by the delta method the effect of 
nonlinearities is asymptotically negligible to first order. However when the object of 
interest is a set with a nontrivial diameter, the resulting effect is of first order even for 
large samples. 



3.2. Bounds on the Elasticity of Labor Supply. In his meta-analysis, IChettyl (I2OI2I ) 

reports point-wise confidence bounds for the structural Hicksian elasticity e of labor 
supply at the intensive margin for given values of the optimization friction 6. The 
reported bounds result from the intersection of bounds of the form (11.41) from estimates 
£j obtained from J empirical studies studies exploiting different natural experiments 
varying the effective income tax Aj logp, j = 1, . . . , J. 

We apply the bootstrap procedure proposed in this paper to obtain joint confidence 
sets for [6, e) based on the LR and Wald statistics. More specifically, we consider the 
moments obtained from individual empirical elasticities 



goF,j{{e,6y) = goF,j{{e,6y,ej) := ^ — k^j^^sfj _ ^_ 



^2(A,logp)2 
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moi.(£, 5; A) = E,=i7^j 7TX — ; — —9oF,j{,e, 5), 



These "raw" moments are then aggregated by a smooth function 

exp(A^OF,j(g,(5)) 
Ef=iexp(A^OF,z(e,5))" 
where A G M is a fixed, positive scalar. Note that as discussed in Section 2, this 
transformation approximates the maximum of ^of,i(£^, 5), • • • , ?of,j(£, 5) as A — )■ oo, 
but satisfies the smoothness conditions for our procedure for any finite value of A > 0. 

We use a parametric bootstrap to obtain the critical value k{l — a), where we approx- 
imate the sampling distribution of the estimators for the respective elasticities by a joint 
normal distribution centered around the estimates reported in Panel A of table 1 with 
standard deviations equal to the respective standard errors and zero covariances. This 
approach can be justified by an assumption that the studies were based on mutually 
independent random samples from possibly different populations. 

Figure p show s that the estimated bounds coincide with the set reported in Figure 8 
of lchettv (120121 ) except for the use of the smoothed maximum function instead of the 
intersection of (e, (5)-sets which leads to a shghtly wider set. The 95% confidence set 
based on the LR statistical reported in Figure [9] is fairly narrow around the estimated 
bound, and does n o t app ear to differ very much from the the collections of confidence 



intervals in 



Chettvl h.ni'k . Chetty presents confidence intervals that are pointwise with 



respect to 5, that is for each fixed value of friction 5, the interval covers structural elas- 
ticity e with a prescribed probability. In contrast, our set estimator covers all plausible 
values of (5, e) with a prescribed probability. Thus it simultaneously performs inference 
on both structural elasticity e and the friction amount 5. The LR confidence region is a 
valid joint confidence set for (e, 5) and not only point-wise in 5. Furthermore, it is not 
conservative in that we assume a joint sampling distribution for the elasticity estimates 
instead of constructing Frechet-Hoeffding bounds. The LR confidence region excludes 
all points with 5 < 0.3%, so that an optimization friction of at least that size would 
be needed to reconcile the different elasticities found in the studies considered in this 
meta-analysis. 



Note that for the LR statistic we used the standard error of the negative part of fh* [9) as a weighting 
function which improves the local approximation due to the asymmetry of the distribution for small 
values of q. Note that for the A'^(0, s{9)'^) distribution, the standard deviation of the negative part is 
proportional to s{9), so that this weighting scheme is asymptotically equivalent to weighting by the 
(inverse of the) local standard deviation of fh*{9). 
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Finally, we also report a 95% confidence region based on the Wald statistic without 
optimal re- weightings As in the case of H J bounds, the shape of the resulting confidence 
set does not reflect the sampling variation in the estimated bounds, and the critical value 
for the Wald statistic is determined by perturbations of the frontier at very low values 
for e. More importantly, in contrast to the LR-based region, the confidence set based 
on the Wald statistic includes points with 6 = 0, failing to reject that the empirical 
elasticities can be reconciled in a model with no optimization frictions and changing one 
of the main conclusions of the analysis. Using weighted W statistics instead fixes this 
problem and gives a confidence set that is very similar to the LR-based confidence set; 
we do not report this confidence set for brevity. 

4. Conclusion 

In this paper we provide new methods for inference on parameter sets and frontiers 
that can be characterized by a smooth nonlinear inequality. The proposed procedures 
are straightforward to implement computationally and have favorable statistical prop- 
erties. By analyzing the geometric and statistical properties of different statistics, we 
illustrate the importance of equivariance and similarity considerations for achieving tight 
confidence regions. In particular, while local weighting is irrelevant for the statistical 
properties of the estimated frontier, it matters greatly for the size and shape of confidence 
sets. We also consider smoothed intersection bounds from multiple inequality restric- 
tions, where we give an exact upper bound for the approximation error that depends 
only on the smoothing parameter. 

We illustrate the practical usefulness of these procedures in financial econometrics with 
various classical examples from mean-variance analysis, including inference on Hansen- 
Jagannathan mean-variance sets of admissible stochastic discount factors, Markowitz- 
Fama mean- variance sets of ad missible portfo lios, and factor-based asset pricing. As a 



second application, we consider IChettyl (l2012[ l's joint bounds for the elasticity of labor 



supply and an optimization friction. This example suggests a broader range of uses for 
set inference in the context of possibly misspecified or incomplete economic models. 

In both examples, using invariant or precision-weighted statistics is important for 
maintaining major empirical conclusions that have been reached informally in prior 



6 



In order to adjust for the differences in order of magnitude we constructed the Hausdorff-distance 
based on the norm (5)|| = _|_ iqq . §2_ Note that in the graph the confidence region looks poorly 
centered around the estimated bound, but this optical impression is in fact due to the different scaling 
of the two axis and the difference in the slope of the frontier above and below its apex. 
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empirical work, e.g. the inability of large values of the elasticity of intertemporal substi- 
tution to generate plausible distributions of stochastic discount factors, or the need for 
nontrivial optimization frictions to reconcile estimated demand elasticities from different 
settings. Therefore, the empirical examples illustrate our formal points about the ad- 
vantages of inference based on a precision weighted metric that is invariant to parameter 
transformations. 
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Appendix A. Proofs 

Proof of Lemma [H W.l.o.g., let maxj gj = gi and rewrite 

, / , , A E/=i exp{A(^j- - ^i)}A(^i - gj) 

Xlmaxgj -m{gi,...,gj;X) \ = — ^ — 

V J / >.,=i expjA o,- - oi I 



Ei=iexp{A(^j -^i)} 
Y.Liew{-hj}hj 



Ej=iexp{-/ij} 

where hj := X{gi — gj) > 0. Clearly, this expression is nonnegative, and since hi = 0, 
the denominator is bounded from below by 1. Next note that the function 

l + Ej=2exp{-/ii} 

is strictly quasi-concave on M::[~^, so that the usual first-order conditions for a local 
extremum are sufficient for a global maximum. We can now verify that the first-order 
conditions for maximization of F{h2, . . . , hj) have the symmetric solution h2 = ■ ■ ■ = 
hj = h* := 1 + W* where W* := W (^). Note that by definition of the product 
logarithm, W* = ^ exp{-iy*} = ( J - 1) exp{-l - W*}, so that 

max . ^U^i-^}'^ ^jJ-DeM-l-pH + W*) 

h2,...,hj>o ^ ' i + ^^^^exp{-/i*} l + (J-l)exp{-l-iy*} 

W* + iJ-l)exp{-l-W*}W* ^^,^^fj-l 



1 + {J - 1) exp{-l - W*} 
Since A (max^ gj - m{gi, ...,gj; A)) = F{X{gi - 5(2), . . . , A(5fi - gj)), we therefore have 



that 



sup A 

gi,-,9j 



m&xgj - m{gi, ...,gj;X) 
3 



sup \F{h2,...,hj)\=W ^ 

h2,...,hj>0 



which establishes the conclusion. 



Next, we will prove four lemmas which will be used to justify the local approximation 
for the Wald statistic. We consider a (stochastic or deterministic) sequence of moment 
functions m„(6') := m{6) — Qn, where g„ = Op(l), and m{9) satisfies Conditions R.1-R.2 
from the main text, and the corresponding sequence of parameter sets B„ := {9 E Q : 
m„(^) < 0}. 
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Lemma 2. Suppose the parameter space © is compact. Suppose that the gradient 
VeTUniO) is hounded away from zero uniformly in 9 and n — 1,2,..., and Lipschitz- 
continuous in 9 with Lipschitz constant L < oo. Also let 9n be any sequence such that 
9n approaches the boundary i.e. d{9n,dQn) 0. Then there exists 5 > such that 
the projection of 9n on ©„ is unique for all such sequences whenever d{9n, ©„) < 

Proof: Suppose the statement wasn't true. Then for some sequence 9^ we could 
construct a subsequence 9k[n) such that there are (at least) two distinct projections of 
9s{n) onto dOn for each n. By compactness of ©, 9k{n) has a convergent sub-subsequence 
9b{n) with lim„^b(n) = ^o, say. Since by construction every member of 9b(n) has two 
distinct projections onto 9©„, we can inscribe a ball of radius r„ := d{9b(^n), 9©n) centered 
at ^b(n) into ©/©n such that this ball has at least two distinct points (^t ^(n)' ^2 6(n)) ™ 
common with 9©„. 

By properties of the projection, the radii of these balls corresponding to the projection 
points, fn^{^b{n) — ^jb{ny) 3 ~ 2' normal vectors to the surface 

dOn a-t ^Ibin) ^2 6(n)' respectively. Note that, since the gradient V6/m„(^) is bounded 
away from zero, we can w.l.o.g. normalize the length of the normal vectors of the surface 
dOn to 1. 

Note that the two points 91 j^^^^ , 9^ are equidistant to 9b(n) , and therefore lie on a 
one dimensional sphere 5'^(r„, ^6(n)) C © with center 9b{n) and radius r„. The curve r]{s) 
corresponding to a sphere of radius r, where s is the arc length, has constant curvature 
k{s) = so that the normal vectors (with length normalized to one) N[9*f^^^^) :— 
^n^^Kn) - 01^^^^) for j = 1, 2 satisfy 

Since r„ — ?> 0, there is no upper bound on r,7^, so that N{9) is not Lipschitz continuous. 
However, the normal vector of OQq at 9 standardized to length 1 is given by 

N{9) := \\Vem{9)\\~^Vem{9). 

Condition R.2 implies that N{9) is Lipschitz continuous in 9 with a constant 1/5 < oo, 
which leads to a contradiction. Finally, note that since that Lipschitz constant L was 
assumed to be uniform in 7, there exists 5 :— S{L) > such that projection is unique 
whenever r„ := d{9n, ©n) < 5 along all sequences 9n and m„(-) □ 
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Lemma 3. Suppose R.l and R.2 hold, and let On he a sequence such that d{6n,Qo) — 
0. Define On = argminm(6»)<o ll^n — ■ Then On is uniquely defined with probability 
approaching 1, and satisfies d{On, 6o) 0. 

Proof of Lemma [HI For a given value of 5 > 0, let rj := min{m(^^) : d{0, ©q) ^ 
By continuity of m{0) and compactness of 6, ?7 > 0. By Condition R.l supgg0jj m(6') < rj 
w.p.a.l, and therefore d{On, 6o) < ^ with probability approaching 1. Uniqueness follows 
from Lemma [21 noting that the conclusion holds regardless of whether the sequence 
mn{0) is deterministic or stochastic □ 

Lemma 4. Suppose Conditions R.l and R.2 hold, and consider a sequence of parameter 
values On ^ Q such that On — )• OQq- For a sequence g„ = Op(l) and 6„ := {6' G : 
rh{0) < Qn}, we have 

V^d{On, §0,5) = [\\V0m{On)\\-\Gn{On) + v^(m(^„) - + + 

In particular, for g„ = 0, we have 

V^d{0n,eo,5) = [\\Vem{0n)\\-\Gn{0n) + V^m{On)) + Op{l)]^ 

Proof of Lemma [31 Without loss of generality, we only consider the case g„ = 0. 
Note that for the general case, we can apply the same argument to the functions m{0)—qn 
and m{0) — qn which inherit the properties R.1-R.2. 

The projection of On onto the set 9o,5 := {0 ^ : m{0) < 0} is given by 

On '■= arg min \\0n — O'W"^. 

The Lagrangian for this constrained minimization problem is \\0n — O'W^ + 2Xm{0'), 
where the Kuhn- Tucker multiplier A > 0. Since 5 > is fixed and d{On, Oq) — it follows 
from Lemma [3] that On is contained in the interior of 0*^ with probability approaching 
one. Hence for n large enough, the quantity On can be taken to be an interior solution 
of the saddle-point problem 

iOn-On) + Vem{On)X = 0, 
m{On) = 0. 

We can now use a mean-value expansion to obtain 

iOn-On) + VemiOn)X = 0, 
m{On) + Vem{Ony{0n-On) = 0, 
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for an intermediate value On- By Lemma[3l Vemipn) = ^9m{6n) +Op(l) and Vefn{9n) = 
Vem{6n) + Op(l). Hence, solving for {9n — 6'„) and applying the extended continuous 
mapping theorem, we obtain 

On - On = -[Vem{On)iVem{On)Vem{On))-^ + Op(l)]m(^„) 

Now, since ^/n{m — m) = Gn = Op{l) in £°°(0'^), we obtain 

V^{0n-0n) = -Vem{0n)i\/emi0n)Vem{0n))-\Gni0n) + V^m{0n)) + Opil) 

Thus we have that 

V^d{On,Qi) = M\On-On\\l{m{0n)>O} 

= \i\/em{On)Vem{On))-'/\GniOn) + v^m(^„)) + Op(l)| (A.l) 
xl{GniOn) + V^m{On) > + Op{l)} (A.2) 
= [\\Vem{0n)\\-\Gn{0n) + V^m{On)) + Op(l)] + 

(A.3) 

which concludes the proof □ 



A.l. Proof of Theorem [U. Part 1. [Limit law of Cn-) Let G„ = y/n{m — m). We 
first show that 



Cn = sup [^/iim{0)/s{0)]l = sup [{Gn{0) + V^m{0))/s 



=(1) sup [{Gn{0) + V^m{0))/s{0) + Op{l)]l 

=(2) sup [iGniO) + V^m{0))/s{0) + Op{l)]l 

^(3) sup [iGiO) + V^m{0))/s{0)]l, 
ee90o 



where the steps follows from the following arguments. 
To show step (1), note that 

sup [{Gn{0) + V^m{0))/s{0)]l> sup [{Gn{0) + V^m{0)) /s{0)]l . 

So we need to show that with probability approaching 1, 

sup [{Gn{0) + V^m{0))/s{0)]l< sup [{Gn{0) + V^m{0)) /s{0) + Op{l)]' 



37 



To show this take a sequence 6n e ©o such that 

sup [(G„(^) + v^m{e))/s{e)]l = [{GniOn) + V^m{en))/s{en)]' 

eeeo 

where the supremum is attained by compactness of ©o and continuity hypotheses on 
m, m, and s, and uniform positivity of s. Since sup^^Q |Gn| — 0^(1), 9n can be selected 
to obey 

^^m(^„)/?(e„) = Op(l). 
We now show that On converges to dOo: fix > 0, and let 

C{k) := sup {m{e) /s{e) : 6 e Qq, d{e, dQo) > i^} ■ 

Next, note that Condition R.2 imphes that m{9) < for each 9 e int ©q: suppose this 
wasn't true, but that m{9o) — for some 9o e int ©o- Then, since for ^ G ©o m{9) < 0, 
9o represents a local maximum of m(-) in the interior of ©o- Since ■m{9) is differentiable, 
this imphes that Vem(^o) — 0. However, Condition R.2 implies that ||Vem(^)|| > 0, 
a contradiction. Therefore m{9) < for each 9 e int ©o, so that by continuity and 
compactness, ({k) < 0. 

By conditions R.l and R.3, we have sup^^gQ^ \ fh{9) /'s{9) — m{9)/s{9)\ < —( with prob- 
ability approaching one for any strictly negative value of C. Hence the supremum of 
^/n [rh{9) /'s{9)]'^ for values ^ e ©o such that d{9, dOo) > k is equal to zero with proba- 
bility approaching one, so that the supremum over all of ©o is attained at a value of 9 
such that d{9, dQo) < k. Since k can be chosen arbitrarily close to zero, we can construct 
a sequence of positive random variables Kn — Op{l) such that 

with probability approaching 1. We conclude that 
sup [{Gn{0) + V^m{9))/s{9)]l 

< sup [{Gn{d + v) + V^rn{9 + r]))/s{9 + r])]l. 

6>eaeo,6»+r)eeo,||??||<Kn 

Using stochastic equicontinuity of G„ implied by R.l, the last quantity is equal to 

sup [{Gn{9) + ^m{9 + r]))ls{9 + r^) + 0^(1)]' , 

6»ea0o,6»+r)e0o,||r)||<K„ 
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for some Op(l) term. Because ^JnmiQ + r/) < for 6' + r/ G 0o and m(6') = for 6* G (90o, 
we conclude that the last quantity is equal to 

sup [(G„(^^)/s(0 + r/) + Op(l)]^, 

for the same Op(l) term. This verifies equality (1). 

Equality (2) follows from using R.3 and that the fact that supg^Q |G„(6')| = Op(l) 
implied by R.l. Equality (3) follows from the application of R.l and the Continuous 
Mapping Theorem. 



Part 2. {Limit Law o/ W„). Recall that we define the set estimator Qq^s as 

00 5 := {eee^ : m{6) < O} . 
In analogy to part 1, we establish the conclusion by the following steps: 



^/Wn = sup ^d{e, Qo,s)/w{e) =(1) sup [{G^ + ^m{e))/w{e) + Op(i)] ^ 

=(2) sup [{Qn{d) + V^Hd))/w{e) + o^{i)\^ 

-w(3) sup [(<G{e) + ^m{e))/w{9)\ 

66900 

where steps (l)-(3) are proven as follows: 
To establish step (1), we first show that 

sup y/nd{e, ©0,5) /w{e) = sup y/nd{9, §0,5) /w{9) (A.4) 
holds with probability approaching 1, where 

e„ = {0 G 60 : d{e,d&o)/w{e) < k^, 

and Kn is some sequence of positive random variables converging to zero in probability, 
i^n = Op(l). Note that right hand side of flA.4p is less than or equal to the left hand side 
of flA.4l) by construction, so we only need to show that w.p.a.l, the right hand side can 
not be less. To this end, fix some k > and note that using the same line of reasoning 
as for part 1, we have 

C{k) := sup{m(^) : G Go, d{e, dQo) > k} < 0. 
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Furthermore, for every C < we have that with probabihty approaching 1, sup^ge \m{6) — 
m{6)\ < — C, and therefore each 6 G &o with d{6,d&()) > k is included in Qo,s- In that 
event, we can only have d{6, &o,s) > for values of 6 within a distance k of the boundary 
dQo- Since k can be chosen arbitrarily small, we can choose a sequence «;„ = o(l) such 
that the right-hand side in flA.4p holds with probability converging to one. 
Next, let 9n be a sequence such that 

sup y/nd{e,Qo^s)/w{9) = y/nd{en,Qo^s) /w{9n). 

Note that since d{6, 60,5) and w{9) are continuous functions of 9, w{6) is bounded away 
from zero with probability approaching 1, so that since Bq is compact, the supremum 
is attained with probability approaching 1. Since 5 > is fixed and by Lemma [3] with 
probability approaching 1, we have d{9n,0n)/w{0) < 6, so that 9n is contained in the 
interior of w.p.a.l. 

It follows from Lemma H] that 

V^d{e^,Qo,5) = [\\Vem{e^)\\-\GM + V^rn{0n)) + Op{l)]^ 

Hence, dividing by w{9) and using stochastic equicontinuity of G„(6'), we can follow the 
same line of reasoning as in part 1 to obtain that with probability approaching 1, 

_ \\Vem{9)\\w{9 + ri) 

Noting again that ^Jnm{9 + rf) <Q for ^ + r/ G Oq, we can bound the expression by 

\Vem{9)\\w{9 + T]) 

where again the Op(l) term is the same as in the first inequality. 

Therefore, we can use uniform convergence of the weighting function w to w from 
Condition R.3 and the continuous mapping theorem to obtain 

W„w sup [{\\Vemmw{9)y^^{0)\\ (A.5) 
where the intermediate steps are analogous to equalities (2) and (3) in the proof for £„. 



Part 3. [Continuity of the Li rnit Distribution s ). Th e continuity of the distribution 
function C on (0, 00) follows from Davydov et al. ( IQQsl ) and from the assumption that 



sup A/n(i(6', 6o,5)/w(6') < sup 

feGo ' 6»e9eo,e+»?eeo,h||<K„ 



sup y/nd{9,Qo^s) /u'{9) < sup 

eeeo ' 6»eaeo,e+r?eeo,hl|<K„ 



+ Op(l) 
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the covariance function of G is non-degenerate, i.e. inf^ge Var(G(^)) > 0. The proba- 
bihty that £ is greater than zero is equal to the probabihty that maxj supggg Gji^O) > 0, 
which is greater than the probabihty that Gjr{6') > for some fixed j' and 6', but 
the latter is equal to 1/2. Therefore the claim follows. The claim of continuity of the 
distribution function of W on (0, oo) follows similarly. □ 

A. 2. Proof of Corollary [1]. This corollary immediately follows from the assumed con- 
ditions and from the comments given in the main text preceding the statement of Corol- 
lary 1. □ 

A.3. Proof of Theorem [H We have that P[eo C R^r] = P[C„ < ^(1 - «)] by the 
construction of the confidence region. We then have that for any a < 1/2 that k{l — a) 
is a continuity point of the distribution function of C, so that for any sufficiently small e 

F[Cn < k{l - a)] < F[Cn < k{l - a) + e] + o(l) F[C < k{l - a) + e], 
F[Cn < k{l - a)] > F[Cn < k{l - a) - e] - o(l) ^ F[C < k{l -a)- e]. 

Since we can set e as small as we like and k{l — a) is a continuity point of the distribution 
function of £, we have that 

P[£„ < ^(1 - a)] F[C < k{l - a)] = (1 - a). 

We can conclude similarly for the W-statistic Wn- □. 

A. 4. Proof of Theorem [31 We will give the proof only for confidence regions based 
on the LR statistic. The arguments for the Wald-type confidence sets are completely 
analogous. Let 5„ be a null sequence where 5„ > for all n and ^/n6n — t- oo. In order to 
show convergence with respect to Hausdorff distance, we establish that with probability 
approaching 1, (a) Rlr C 6o", and (b) 6o C i?^"^, where for 6 > 0, := {x G M'' : 
d{x, A) < 6} denotes the 5-expansion of a set A in M^. 

To prove statement (a), consider a sequence On G 6\6o". We have to show that 
9n ^ Rlr w.p.a.l as n increases: For any fixed 6 > 0, let r] := infg.^(g 0jj)>jm(^). 
Since G is compact and m{6) is continuous by R.2, it follows from the definition of Gq 
that T] > 0. Hence, along any sequence 9n such that d{9n, Go) > S for all n, we have 
y/nm{6n) — oo. Since 9 G Rlr if and only if for the critical value k CTZn < fc, where k 
is tight for any n large enough, it follows that 9n 4- Rlr w.p.a.l. Hence we can restrict 
our attention to (sub-) sequences for which d{9n, Gq) — )■ 0. 
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Now suppose that d{9n, 60) — )■ 0, and y/nd{6n, 60) — 00. Let 6^* be the projection of 
On onto i96o, where 6'* need not converge to a particular point. Then using the same 
steps as in the proof of Lemma H] it follows from a mean- value expansion of m{9) around 
On and continuity of the gradient V6)m(6') that y/nm[dn) 00. Hence the LR-statistic 
diverges to infinity, and On 4- ^lr w.p.a.l. 

It remains to check claim (b), namely that Gq C -R/jj- To this end, consider a sequence 
On G Oo, and let g„ = 's{0)^ k{l — a)/n. Also define On = argminm(5i)<g^ \\0n — OW^, so 
that d{0n,Q>o,5) = \\0n — On\\- Now note that Lemma [3] implies that d{On,Qo) A 0, so 
that we can apply Lemma H] to obtain 

y^d{0n,0n) = [\\Vem{0n)\\-\Gn{0n) + V^{m{On) - qn)) + Op{l)] + 

By assumption R.2, the norm of V0m{On) is bounded away from zero, and G„(^„) is 
stochastically bounded by Condition R.l. Since s(6') and k are stochastically bounded 
by R.3, y/nqn is also stochastically bounded. 

Noting that m{On) < 0, it follows that ^/n{On — On) = Op{l). Therefore 

sup y/nd{0,RLR) = Op(l) 

so that for any sequence Sn such that y/n6n — ?■ 00, we have d{0, RLfi)/Sn — > 0. Hence 
9o C -R^'^ w.p.a.l, which concludes the proof □ 

A. 5. Proof of Corollary [21 This corollary immediately follows from the assumed con- 
ditions and Corollary 1. □ 

A. 6. Proof of Theorem 14]. In what follows let A denote an absolute positive constant. 
We have by definition of the Kantarovich-Rubenstein metric that 

^QvMV*)] - EqvHV)] = Op(l) uniformly in G BLi(C(0)). 

This implies that 

Eq,, H[V*]l)] - Eq^[^{[V]1)] = Op{l) uniformly in ^ G BLi(C(e)), 

since the composition (y9 o [-J^ g A ■ BLi(C(6)) for ip G BLi(C(6)). This further implies 
that 

EQ^^[^'{snp[V*]l)] - EQ^[y,'{sup[V]l)] = Op(l) uniformly in ^' G BLi(M), 



42 

since the composition (^'(sup^^ [-J^) G A ■ BLi(C(6)) for Lp' G BLi(R) and i?„ denoting 
any sequence of closed non-empty subsets in 9. Therefore, by the Extended Continuous 
Mapping Theorem, 

Eq^, [y.'(sup[V*]2+)] - EQ,[^'isnp[V]l)] = o,(l) uniformly in ^' G BLi(M). 

900 900 

(Note that here we compute expectations over V* and V taking dOo as given; note that 
our bootstrap method treats dOo as fixed.) Also note that any sequence of sets i?„ 
converging to a set R, we have that 

|Eg,[^'(sUp[l^]^)-^'(sUp[\/]^)]| 

Rn R 

< Eq^[\ snp[V]l - sup[y]^| A 1] = Op(l) uniformly in (f' G BLi(M), 

since 

sup[y]^ - sup[y]^ = Op(l) 

Rn R 

by stochastic equicontinuity of the process V. Since by Condition R.1-R.3 and Theorem 
[31 dOo converges to dQo in the Hausdorff distance, we have by the Extended Continuous 
Mapping Theorem: 

\EQy[ip'{snp[V]l) - ip'{snp[V]l)]\ = Op(l) uniformly in cp' G BLi(R), 

900 

where Qv computes the expectation over V, treating dQo as fixed. 

Combining the steps above, we conclude by the triangle inequality that: 

|EQ,J<^'(sup[y*]^) - ^'(sup[K]^)]| = Op(l) uniformly in ^' G BLi(R), 

900 

which is the same as 

PK{Qs*,Qs) = Op{l). 

(Note that the bootstrap random variable S* is computed having fixed dQo.) 

It is known that the convergence Pk{Qs„, Qs) = o{l), for any sequence of laws Qs„ 
of a sequence of random variables Sn defined on probability space {Q',J^',Pn) implies 
the convergence of the distribution function 

PlQ,JSn<s]=PTQ,[S<s] + 0{l), 
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at each continuity point (0, oo) of the mapping s i— Pr[i5 < s] and also convergence of 
quantile functions 

inf{s : Ftq^^ [Sn <s]>p} = inf{s : Fr q,[S < s] > p} + o(l) 

at each continuity point p of the mapping s inf{s : PrQ^[5 < s] > p}. Recall from 
Theorem 1 that the set of continuity points necessarily includes the region (0, 1/2). 



van der Vaart 



By the Extended Continuous Mapping Theorem (see e.g. Theorem 18.11 in[ 
( 119981 )) we conclude that since Pk{Qs*, Qs) = Op(l), we obtain the convergence in prob- 
ability of the distribution function 

PiQ,4S* <s]=PrQAS<s]+o,{l), 

at each continuity point (0, oo) of the mapping s i— t- Pt[S < s] and also convergence in 
probability of the quantile functions 

inf{s : PrQ^. [S* < s] > p} = mf{s : Ptq^S < s] > p} + Op(l), 

at each continuity point p of the mapping s i— )■ inf {s : Prg^ [5 < s] > p}. □ 



A. 7. Proof of Proposition [H First note that 7hj and 7mf are continuous, differ- 
entiable functions of the elements in f , S, and B, so that C.l follows from a CLT for 
vec(S), vec(-B) and the delta-rule. 

Next, we check Condition C.2 for the HJ bounds. Recall that the moment function 
defining the HJ bound was given by 

mnjiifJ', 0-)', iHj) = VSvvP'^ - 25^1/1 + 5*11 - a 

with Svv, Syi, Su as defined in section 1. 

Since the derivative of rnHj{{l^,CF)' with respect to a is equal to minus one, the 
lower bound on the norm of the gradient holds for all ^ G O. Also, since the eigenvalues 
of S are bounded away from zero, and the elements of \v\ are bounded, l^^^l and l^^il 
are also bounded. Furthermore, we can easily verify that 

C C C2 

OvvfJ' — ^Oyip + Oil ^ • 

for all values of fi. Now, since the eigenvalues of are bounded away from zero, and 
v'vVjyflN — {^'n'^Y is bounded from below by a positive constant by assumption, we can 
also bound S^^/i^ — 2S'„i/i + away from zero. 
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Noting that the parameter space 6*^ x F is compact, it follows that the second deriva- 
tives of m^^'^\9, 'jHj) with respect to jhj and 9, respectively, are bounded. In particular, 
this implies Lipschitz continuity of the gradients V^m^^'^\9, 'Jhj) and V em^^'^\9, 'Jhj)- 

The arguments for the moment function mMF{9,'yMF) for the MF mean- variance set 
for asset portfohos are completely analogous. □ 

A. 8. Proof of Corollary [3l First note that by Proposition [1], Conditions C.l and C.2 
hold. Hence, in order to prove this corollary it suffices to show that 

PK{Qt'z*,Qt'z;Cm = o,ii). 

Without loss of generality we can take sup^ 11^(6*) || < 1 and sup^ 11^(6*) || < 1. The claim 
will follow from 

pk{Qvz', Qt'z; cm < PKiQt'z*, Qt'z; cm + pk{Qvz, Qfz; cm = Op{i). 

That PK{Qt'z*j Qt'z'i C{Q)) = Op(l) follows immediately from pk{Qz*, Qz) = Op{l) and 
ifi?-) G BLi(R'=). Indeed, 

\y^{t{9)'z{9)) - ^{t{9)'z{9))\ < sup \t{9)'{z{9) - z{9))\ A 2 

e 

< [(sup II sup II ^(^) - z{9) II ) A 2] 

e e 

< [sup II ^(^) -^(0) II A 2]. 

e 

That pxiQfz^ Qt'z] C{Q)) = Op(l) follows because uniformly in G BLi(C(6)) 

\EQMt'Z)]-^it'Z)\ < EQ^lsnp \iti9)- my Z{9)\ A 2] 

e 

< Eq,[sup ||f(^) - mW sup ||Z(^)|| A 2] = 0,(1), 
e e 

where Eq^ computes the expectation over Z, treating t as fixed. □ 

A. 9. Proof of Proposition [Si For the LR-type statistic, it is sufficient to notice that 
Cn{9]ri) = Cniv^^ivi^))) = ^^8)^^^ ' therefore only depends only on quantities 
evaluated at 9. Hence, 

£„ = sup £„(6') = sup Cniv'^iv)), 

and 4'Ln{9] r]) is invariant with respect to parameter transformations. Asymptotic simi- 
larity follows from Condition R.l and the continuous mapping theorem. 
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Next, define ifo,<5 = {^7 ^ ri{Q^) : fh{ri) < 0}, noting that Hqs = vi'^o,s)- We now 
distinguish three cases regarding the hmit point of the sequence 6'„: (1) 6*0 G int Go, 
the interior of Go, (2) 6 E G/Gq, and (3) 6 E dQo. 

Since the weighting functions w{6; rf) were assumed to satisfy Condition R.3, Theorem 
[3] imphes that in case (1), (pwn{0',v) ~^ values of r], including the identity 

transformation, so that by continuity of VV„(0; r/) in 6, we have 

lim P{4>wn{0n]r]) 4>Wn{0n]r])) 

n—^oo 

< max{ lim P(0iy„(^o; v) = 1), 1™ P{(f)wn{0o) = 1)} = 0. 
Similarly, in case (2), we have 

lim F{(f)wni0n;'n) (t)Wn{On\V)) 
n— ^-oo 

< max{ lim P{(t)wn{Oo\ r/) = 0), lim P(</.h'„(^o) = 0)} = 0. 

Finally, consider the third case in which 9q E OQq : Using the expansion in Lemma HI we 
obtain that 



n 2 



rf(r/(^n),//o,, 
w{dn]r]) 



-\ 2 



H 2 



|V^m^(6'„; 



+ Op(l) 



(A.6) 



Now suppose we split the sequence 9n into three (possibly trivial) subsequences 9q^ , , 
and 9s„, respectively, such that \/nm{9q^) = 0(1), y/nm{9r„) oo, and y/nm{9s^) — )■ 
— oo. We can now analyze the behavior of (pwni') separately along each of these subse- 
quences. By Theorem [3l 

r„;^) = 0) = lim P{(j)ws„i0s„;v) 



lim P{4>wr 

n—^oo 







for all ?7, so that by the same arguments as before, the test is asymptotically invariant 
along these subsequences. 

Finally, consider the sequence 9q^: if w{9;r]) = yy m{'!){e))\\ ~^ *^p(-'-)' expansion in 
( ]A.6|) together with the continuous mapping theorem implies that along the subsequence 
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KOq 



+ Op{l + 



so that the leading term of this expression is a function of 6 which does not depend on 
V- 

Now note that for 6 G dQo we have m{6) = 0, so that by this expansion and stochastic 
equicontinuity of G{6) we have 



Wniv) ■= sup Wnie;v) 

6)6900 



sup 

eedeo 



G{9) 



where the hmit does not depend on r]. It follows that kwrji^ — a) = kw{^ — «) for all i], 
so that (f)wn{S]v) is asymptotically invariant along 6q^ with respect to transformations 
T] G H*, which establishes the second conclusion. 

To establish the last claim, consider the Wald statistic in (12 ■4p with weighting function 
'^(^) ~ ||y^^|g)|| + Op{l). By Condition R.l and the approximation in ( 1A.5I) . Wnid) 
converges in distribution to max{0, Z}^, where Z ~ A^(0, 1), for all values of 6* G OQq 
and is therefore asymptotically pivotal on the boundary of Gq. Hence for any fixed 
critical level k, the limit lim„^oo P (VV„(^^) > k) is constant across all values of ^ G OQq 
for any value k. 
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Figure 1. Estimated HJ Bounds with Stochastic Discount Factors im- 
phed by CRRA preferences, data labels represent values for the lES g 




mean 



Figure 2. Estimated HJ Bounds, Mean-Standard deviation pairs based 
on Consumption SDF, and Bootstrap Draws 
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Figure 3. 95% Confidence Set for tlie HJ Set using LR Statistic. Tlie 
95% confidence set for mean and standard deviation of tlie consumption- 
based SDF was also constructed using the LR Statistic for the moment 
equality defining the corresponding mean-variance pairs. 
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Figure 4. 95% Confidence Set for the HJ Set using Unweighted LR Statistic 
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Figure 5. 95% Confidence Set for tlie HJ Set using Weiglited W Statistic 




Figure 6. 95% Confidence Set for tlie HJ Set using Unweigfited W Statis- 
tic (directed Hausdorff Distance) 
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Figure 8. Bounds on Structural Labor Supply Elasticity based on Esti- 
mates from Different Studies, and their Smoothed Intersection Bound 
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Figure 10. Smoothed Bound with 95% Confidence Set for the plausible 
pairs of the structural elasticity e and friction 5 using Unweighted W- 
Statistic 



